Data Anonymization: An Experimental Evaluation Using Open-Source Tools
نویسندگان
چکیده
In recent years, the use of personal data in marketing, scientific and medical investigation, forecasting future trends has really increased. This information is used by government, companies, individuals, should not contain any sensitive that allows identification an individual. Therefore, anonymization essential nowadays. Data changes original to make it difficult identify ARX Anonymization Amnesia are two popular open-source tools simplify this process. paper, we evaluate these ways: with OSSpal methodology, using a public dataset most tweets about Pfizer BioNTech vaccine. The assessment methodology determines better results than Amnesia. experimental evaluation dataset, possible verify some errors limitations, but process simpler. Using Anonymization, upload big datasets tool does show error We concluded one recommended anonymization.
منابع مشابه
Evaluation of Data Anonymization Tools
This survey became possible due to coming request of one of Siemens Business Units to look for data anonymization solutions being presented in the market today. The customer plans to implement and deploy it within software development projects to provide offshore team with a fully functional environment without any critical data in it. Critical data are, for instance, Personal Identifiable Info...
متن کاملSecGraph: A Uniform and Open-source Evaluation System for Graph Data Anonymization and De-anonymization
In this paper, we analyze and systematize the state-ofthe-art graph data privacy and utility techniques. Specifically, we propose and develop SecGraph (available at [1]), a uniform and open-source Secure Graph data sharing/publishing system. In SecGraph, we systematically study, implement, and evaluate 11 graph data anonymization algorithms, 19 data utility metrics, and 15 modern Structure-base...
متن کاملOpen-source tools for data mining.
With a growing volume of biomedical databases and repositories, the need to develop a set of tools to address their analysis and support knowledge discovery is becoming acute. The data mining community has developed a substantial set of techniques for computational treatment of these data. In this article, we discuss the evolution of open-source toolboxes that data mining researchers and enthus...
متن کاملAn Evaluation of Open Source Unit Testing Tools Suitable for Data Warehouse Testing
Verification and validation are two important processes in the software system lifecycle. Despite the importance of these processes, a recent survey has shown that testing of data warehouse systems is currently neglected. The survey participants named besides others modest budget and the lack of appropriate tools as potential reasons for this circumstance. In order to verify these reasons, the ...
متن کاملOpen source software - an evaluation
The success of Linux and Apache has strengthened the opinion that the open source paradigm is one of the most promising strategies to enhance the maturity, quality, and efficiency of software development activities. This observation, however, has not been discussed in much detail and critically addressed by the software engineering community. Most of the claims associated with open source appea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Future Internet
سال: 2022
ISSN: ['1999-5903']
DOI: https://doi.org/10.3390/fi14060167